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Abstract 



Until recently, techniques for obtaining lower bounds for kernelization were one of the most 
sought after tools in the field of parameterized complexity. Now, after a strong influx of tech- 
CO I niques, we are in the fortunate situation of having tools available that are even stronger than 

what has been required in their applications so far. Based on a result of Fortnow and San- 
thanam (STOC 2008, JCSS 2011), Bodlaender et al. (ICALP 2008, JCSS 2009) showed that, 
X^/y ■ unless NP C coNP/poly, the existence of a deterministic polynomial-time composition algo- 

^^ I rithm, i.e., an algorithm which outputs an instance of bounded parameter value which is yes if 

• ■ and only if one oft input instances is yes, rules out the existence of polynomial kernels for a prob- 

Q I lem. Dell and van Melkebeek (STOC 2010) continued this line of research and, amongst others, 

were able to rule out kernels of size 0{k'^~'^) for certain problems, assuming NP ^ coNP/poly. 
It is an immediate consequence of their work that even the existence of a co-nondeterministic 
^ ■ composition rules out polynomial kernels. However, in contrast to the numerous applications of 

^^ I deterministic composition, the added power of co-nondeterminism has not yet been harnessed 

^D • to obtain kernelization lower bounds. 

l^ I In this work we present the first example of how co-nondeterminism can help to make a 

. ■ composition algorithm. We study the existence of polynomial kernels for a Ramsey-type prob- 

l^ I lem: Given a graph G and an integer k, the question is whether G contains an independent 

set or a clique of size at least k. It was asked by Rod Downey whether this problem admits a 

polynomial kernelization, and such a result would greatly speed up the computation of Ramsey 

numbers. We provide a co-nondeterministic composition based on embedding t instances into a 

single host graph H. The crux is that the host graph H needs to observe a bound of ^ e O{\ogt) 

^^ . on both its maximum independent set and maximum clique size, while also having a cover of its 

^ '_ vertex set by independent sets and cliques all of size i; the co-nondeterministic composition is 

build around the search for such graphs. Thus we show that, unless NP C coNP/poly (and the 

polynomial hierarchy collapses), the problem does not admit a kernelization with polynomial 

size guarantee. 

1 Introduction 

Parameterized complexity refines classical complexity by taking into account not only the size of a 
given input but also one or more additional parameters like solution size, or structural measures like 
various notions of width for graphs. The main positive result that one seeks to obtain, is to show 
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that instances (x, k) of a given NP-hard problem can be solved in time 0{f{k) ■ \x\^) where / is a 
computable function and c is a constant independent of k; this is called fixed-parameter tractability. 
It entails Odxl'^) algorithms for every bounded value of k. If the chosen parameter k can be expected 
to be small in practice, then this is a strong improvement over a worst-case exponential time, 
e.g., 0{a'^'), algorithm that one would otherwise have to resort to (given our current knowledge 
of P vs. NP and hypotheses like the exponential time hypothesis, cf. [22j). 

Kernelization takes the perspective that if the chosen parameter k is small when compared to 
the size of a given instance {x,k), then strong insights into the structure of the instance should 
be possible which allow to discard large parts of x in polynomial time and leave an equivalent 
instance of size bounded by some function in k. Interestingly, by a folklore result, the problems 
with such a kernelization are exactly those in the class FPT of fixed-parameter tractable problems. 
This shows that kernelization is a robust definition of data reduction, which is not possible when 
taking into account only the input size (see also the discussion by Harnik and Naor [13] in a study 
of compression related to witness size). An important subclass of FPT is formed by those problems 
allowing kernelizations with size guarantee polynomial in k, capturing plenty of results with linear 
or quadratic size kernels, e.g., [SIlEllIO], but enjoying the good closure properties of polynomials. 

A nice feature of kernelization is that since many parameters can be well approximated, it is 
not necessary to follow up with an exact or FPT algorithm or even to adopt the framework of 
parameterized complexity in the first place. Since only polynomial time is invested to get the 
kernelized instance, it is just as valid to run an approximation, randomized, or heuristic algorithm 
afterwards. In fact, reduction rules have had fair use in other areas already and, e.g., primal-dual 
approximation techniques are quite related to standard arguments in kernelization which start from 
a packing of forbidden structures (see, for example, Paul et al. [IB]). 

Until recently, techniques for obtaining lower bounds for kernelization were one of the most 
sought after tools in the field of parameterized complexity (see, e.g., a 2007 survey of Guo and 
Niedermeier |12]). This was especially true for the threshold of whether or not a problem would 
allow a polynomial kernel. Now, after a strong influx of techniques [21 \TT\ O [71 HI E], we are in 
the fortunate situation of having tools that are even stronger than what has been required in their 
applications so far. 

Let us take a high level view of the main technique for excluding polynomial kernels. The 
central piece is that of a composition algorithm which takes as input t instances {xi, k), . . . ,{xt,k) 
and produces in polynomial time an instance (y, k') which is yes if and only at least one (xj, k) is 
yes, and, crucially, with k' polynomially bounded in k. When combined with a polynomial kernel- 
ization this gives a distillation algorithm for the underlying classical problem which given xi, . . . ,xt 
computes in polynomial time an instance y which is yes if at least one Xi is yes, and whose size 
is polynomially bounded in the largest Xj. The intuition of this framework given by Bodlaender et 
al. [2] is that when t exceeds the size of y (which is independent of t) then there will be less than one 
bit of information per instance; they conjectured that NP-hard problems do not have distillation 
algorithms. Fortnow and Santhanam Jllj proved the conjecture to be true under the assumption 
that NP ^ coNP/poly (known to otherwise cause a collapse of the polynomial hierarchy [23j). This 
led to flurry of papers showing composition algorithms for various problems, e.g., [H [HI [SI [16] , and 
thus excluding polynomial kernelizations assuming NP ^ coNP/poly. 

By a generalization of the work of Fortnow and Santhanam [11] Dell and van Melkebeek [7] show 
that languages L which have an oracle communication protocol for deciding instances {xi, . . . ,xt) 
of OR{L) with (communication) cost 0{t\ogt) are contained in coNP/poly; given [xi, . . . ,xt), 



the OR{L) problem asks whether at least one Xi is contained in L. They conclude that NP-hard lan- 
guages L do not have such protocols unless NP C coNP/poly. Combined with an intricate packing 
lemma, this led to their main result that satisfiability of d-CNF formulas does not allow nontrivial 
sparsification, i.e., instances with n variables cannot be compressed to size 0{n ). Amongst other 
things, they also obtain polynomial lower bounds for kernelization, e.g., non-existence of a 0{k'^~'') 
sized kernel for Vertex Cover (all results assuming NP ^ coNP/poly). Combining a polynomial 
kernelization and a composition algorithm naturally gives an oracle communication protocol [7J. 

An interesting new aspect in the lower bounds via oracle communication protocols (see Section [3] 
for a definition) is that the exclusion of protocols of cost 0{tlogt) holds, explicitly, even when the 
first player (holding the input and communicating with an all-powerful oracle) is allowed to behave 
co-nondeterministically jT]. The fact that co-nondeterminism can be allowed is already implicit 
in the work of Fortnow and Santhanam [11], as observed by Chen and Miiller (cf. [I3]). The 
key observation seems to be that, essentially, a kernel and a composition are used as subroutines 
in a coNP-machine for accepting an NP-hard language. Hence, relaxing the subroutines to co- 
nondeterministic behavior as well does not harm the properties of the accepting machine. To our 
knowledge, the only result so far making use of co-nondeterminism is the lower bound of 0{n'^~'') 
on PCPs for d-SAT [7j. In particular, the implicit notion of co-nondeterministic composition is left 
largely unexplored, despite of the high interest in a set of problems that so far resisted a classification 
into admitting or non admitting a polynomial kernelization, e.g.. Directed Feedback Vertex 
Set and Multiway Cut. Building on the work of Dell and van Melkebeek [7], recent work of 
Hermelin and Wu [14] defines a notion called weak composition which permits a larger dependence 
on the number t of instances. They obtain concrete polynomial lower bounds in the style of Dell 
and van Melkebeek [7], i.e., for problems which admit some polynomial kernel. By definition, weak 
compositions allow co-nondeterminism, but the current results make no use of this option. Our 
co-nondeterministic composition excludes kernels of any polynomial size. 

The Ramsey problem. Recently, Rod Downey posed the interesting question of whether the 
following combination of the well-known Clique and Independent Set problems admits a poly- 
nomial kernel [17]. We call it Ramsey(A;) for brevity. 

RAMSEY(fc) 

Input: A graph G and an integer k. 

Parameter: k. 

Question: Does G contain an independent set or clique of size k? 

Unlike Clique and Independent Set, the problem is FPT by a more general result of Khot 
and Raman [15] which uses Ramsey's Theorem: Let R{k) denote the smallest integer N such that 
each graph with A^ vertices contains an independent set or a clique of size k; Ramsey showed these 
numbers to exist and to be computable [19j. If G has more than R{k) vertices, then the instance 
is yes. Otherwise, the number of possible solutions is bounded by f{k) = (R{k))^; since R{k) is 
computable this suffices to prove fixed-parameter tractability (see Section [2] for explicit upper and 
lower bounds on R{k)). However, it is open whether or not there is a polynomial kernelization for 
it. The question of small kernels for the RAMSEY (/c) problem is well- motivated: There are as of 
yet no efficient algorithms known for computing Ramsey numbers; a brute-force way is to check 
all non-isomorphic graphs on N vertices for fe-cliques or /^-independent sets in order to determine 
whether R{k) < N. The known bounds for R[k) imply that this requires A^ to be of order 0{a''), 



giving a runtime of 0{a^ ) per graph (trying all sets of k vertices). A polynomial kernelization 
which guarantees reduction to 0{k^) vertices would yield runtime 0{{k'^) ) = 0{a ^^ ) per graph. 

Our work. Regarding polynomial kernelization for Ramsey(A;) we demonstrate two things. We 
disprove the existence of polynomial kernels for Ramsey(A;) unless NP C coNP/poly. We thereby 
show for the first time how to exploit co-nondeterminism to construct a composition algorithm. 
It appears that the co-nondeterminism is necessary to realize our composition algorithm, since it 
involves detection of cliques and independent sets (see below). 

Techniques and related work. Unlike for the problems Clique and /c-Path [IHIE], the disjoint 
union of t instances of Ramsey(A;) does not work satisfactorily as a composition algorithm (and 
neither would a join of the instances) as it would contain independent sets of size Vt{t). The intricate 
Packing Lemma due to Dell and van Melkebeek [TJ Lemma 1], designed of course for a different 
purpose, does not seem to be applicable either as it constructs an n-partite graph containing 
independent sets of size n which cannot be bounded in 0{\ogt) when t := t{n) is polynomially- 
bounded. Generally, it appears to be unlikely that one could pack the instances in such a way that 
solutions are confined to a part representing a single original instance. 

Our construction can best be motivated by a simplified example. Let t = £'^ instances of 
Ramsey(A;) be given, say, (Gi, k), . . . , (Gt, k), and assume that each instance contains at least one 
independent set and one clique of size k — 1. We construct a graph G' as follows: Let G' contain 
copies of the graphs Gi, . . . ,Gt, and pick an arbitrary partition of the graphs into i groups of size i 
each. Then add all edges between vertices of different graphs that are in the same group. Now, if 
all t instances are no, then it can be verified that G' contains no clique and no independent set of 
size greater than i ■ (k — 1): The reason is that any clique or independent set can contain vertices 
from at most i graphs Gi (each clique only from one group; each independent set only from one 
graph per group). If at least one instance is yes then its independent set or clique of size k can be 
extended with k — 1 vertices of each of i—1 other graphs; this gives a solution of size ^ • (A; — 1) + 1. 
Thus asking whether G' has an independent set or clique of size at least £ • (A: — 1) + 1 G 0{-s/tk) 
is equivalent to whether at least one instance {Gi,k) is yes. We mention in passing that such 
a composition excludes kernels of size 0{k'^~'^) by recent work of Hermelin and Wu |14j . or by 
deriving an appropriate communication protocol and applying the mentioned result of Dell and 
van Melkebeek |7j. 

The reader may have noticed that in the example we have connected the instances according to 
the complement of the Turan graph T[t,i) which (for t = i"^) contains no independent set or clique 
of size greater than L The other equally important feature of the Turan graph that we exploited 
is that each vertex is contained in both an independent set and a clique of size exactly i. This way 
the distinction whether or not any one graph Gi has a solution of size k (instead of just k — 1) 
makes the crucial difference for the instance {G',i{k — 1) + 1). Motivated by this example the 
main work lies in finding a better host graph H to replace T{n,£) which has similar properties 
but with i G 0{logt). No deterministic construction is known for such graphs, despite fairly 
recent progress on deterministic construction of Ramsey graphs without cliques or independent 
sets of size t* + 1 = t°^^' [1\. While i = t* can be seen to still exclude polynomial kernels (cf. 
Section [3]), it seems unlikely that those graphs would support a cover with cliques and independent 
sets each of size t*; also, our tighter logarithmic dependence on t may have other consequences for 
kernels. We ensure the covering property by using gaps between Ramsey numbers R{i) and R{i+1) 



when a. G 0{\ogt). This in turn would require deterministic constructions for 0(logt)-Ramsey 
graphs which is open. 

Organization. In Section [2] we recall the necessary definitions, mention upper and lower bounds 
on Ramsey numbers, and introduce a refinement version of Ramsey(A;) which will be used for the 
composition. In Section [3l we state the required result of Dell and van Melkebeek [7], introduce the 
notion of co-nondeterministic composition which we will use, and show that this concept excludes 
polynomial kernels, assuming NP ^ coNP/poly. In Section U] we show an embedding of graphs into 
a host graph, motivated by the example using the edge complement of a Turan graph, but somewhat 
tweaked to lessen the restriction on the host graph. Section [5] then gives the co-nondeterministic 
composition and derives our main result. We conclude in Section [6l 

2 Preliminaries 

Graphs. All graphs considered in this work are finite, simple, and undirected. By the join of 
two graphs (or two connected components), we mean the operation of adding all edges between 
vertices of different graphs (or components). With a{G) and uj{G) we denote the maximum size of 
independent sets or cliques in G, respectively. 

Ramsey numbers. The Ramsey number R{k) is the smallest integer such that every graph 
on R{k) vertices contains a clique or an independent set of size k. Ramsey's Theorem [19j shows 
that this number is finite. Currently the best bounds on these diagonal Ramsey numbers are as 
follows: Providing an upper bound, Conlon [6] shows that there is a constant D, such that for 
sufficiently large /c G N we have 

R{k + l) < k ^logiogfe ( 
Spencer [20] shows with an application of Lovasz' Local Lemma that 

Parameterized problems and kernels. A parameterized problem Q over alphabet S is a subset 
of S* X N. The problem Q is fixed-parameter tractable if there exists an algorithm A, a computable 
function /, and a constant c, such that A decides membership in Q for any instance (x, /c) in 
time 0{f{k)n'^). The problem Q admits a kernelization (or kernel) if there is a polynomial-time 
algorithm K and a computable function h, such that K transforms any instance (x, A;) into an equiv- 
alent instance {x',k') with |x'|, /c' < h(k). The function h is called the size of the kernelization K 
and we say K is a polynomial kernelization if h{k) is polynomially bounded. 

Refinement version of Ramsey (fc). Instead of considering Ramsey(/c) directly, we focus on 
the following refinement version, in which the given graph is guaranteed to contain both a clique 
and an independent set of size k — 1 (for ease of notation we omit the details of giving the k — 1-sized 
independent set and clique in the input). Bodlaender et al. [2j use such problem variants to exclude, 
e.g., polynomial kernels for Independent Set parameterized by treewidth. 



Refinement RAMSEY(/i;) 

Input: A graph G and an integer /c, such that G has both an independent set and a 

chque of size k — 1. 

Parameter: k. 

Question: Does G contain an independent set or chque of size kl 

A simple reduction from Ramsey(/c) to Refinement Ramsey(A:) which only increases the 
parameter by one shows that lower bounds transfer directly from the latter to the former problem; 
it is useful to note that instances for Refinement Ramsey (/c) are also legal for Ramsey (A;), and 
applying the latter gives the same answer. We will use this later to transfer our obtained lower 
bound from Refinement Ramsey(A;) to Ramsey(A:) (a more general argument for transferring 
lower bounds is due to Bodlaender et al. [5]). 

Lemma 1. There is a polynomial-time reduction reducing any instance {G,k) of Ramsey f'/cj to 
an equivalent instance {G',k + 1) of Refinement Ramsey ffcj. 

Proof. Given an instance {G,k) of RAMSEY(/i;), and assuming w.l.o.g. that A; > 3, construct G' 
starting with a copy of G. Add a clique G on k — 1 vertices to G'. Then add an independent set I 
with k vertices to G' and make a join with all other vertices of G' (in the copy of G and in the 
clique C). Return {G',k + 1). 

If G contains a /c-clique, then in G' a vertex of / can be added to this clique to obtain a A: + 1- 
clique; if it contains a /c-independent set then in G' a vertex of G can be added. Conversely, if G' 
has a A; + 1-clique C", then this clique contains at most one vertex of /. Furthermore G' cannot 
intersect C, else it could contain no vertex of the copy of G limiting its size to k (including the 
one vertex of I); thus G' contains a A;-clique in the copy of G. Similarly, if G' contains a A; + 1- 
independent set /' then it cannot contain vertices of /, otherwise it could contain no further vertices 
due to the join operation. Thus it contains at most one vertex of the clique G and an independent 
set of size at least k in the copy of G. Finally, we observe that G' contains a A:-independent set, 
namely /, and a A;-clique, formed by G plus an arbitrary vertex of /. This completes the proof. D 

We give a straightforward proof for NP-hardness of Refinement Ramsey(A;) and Ramsey(A;). 
This is a prerequisite for the lower bound tools. 

Theorem 1. The problems Ramsey ('A; j and Refinement Ramsey f'Ajj are hard for NF. 

Proof. We give a reduction from Clique. Let (G, A;) be an instance of Clique, where G has n 
vertices. We construct a graph G' by adding to G a clique C on n + 1 vertices, and adding 
all edges between the vertices of G and C (i.e., we perform a join operation on G and G). We 
return (C, A; + n + 1) and claim that it is an equivalent instance of Ramsey(A:). 

Clearly the maximum clique size uj{G') of G' is equal to uj{G) + n + 1. We note also that the 
maximum independent size a(G') of G' is at most n, since independent sets in G' can either use 
the vertices of G or a single vertex of the clique G. 

Thus if {G, k) is a yes-instance then u>{G) > k and oj{G') > A; + n + 1, and (C, A; + n + 1) is a 
yes-instance too. On the other hand, if (C, A; + n + 1) is a yes-instance then uj{G') > A; + n + 1 
since a{G') < n, implying that uj{G) > k and that {G,k) is a yes-instance. Thus Ramsey(A;) is 
NP-hard. NP-hardness of Refinement Ramsey(A;) now follows from Lemma [H D 



3 Lower bounds for kernelization 

In this section we briefly recall the relevant results and definitions required to obtain our lower 
bound result. The main tool is the following lemma due to Dell and van Melkebeek [7|. Before 
stating the lemma, we recall their definition of an oracle communication protocol. 

Definition 1 ([7J. An oracle communication protocol for a language L is a communication protocol 
for two players. The first player is given the input x and has to run in time polynomial in the length 
of the input; the second player is computationally unbounded but is not given any part of x. At the 
end of the protocol the first player should be able to decide whether x €z L. The cost of the protocol 
is the number of bits of communication from the first player to the second player. 

Lemma 2 ([7J). Let L be a language and i: N — )• N \ {0} be polynomially bounded such that 
the problem of deciding whether at least one out of t{s) inputs of length at most s belongs to L 
has an oracle communication protocol of cost 0{t{s)\ogt{s)), where the first player can be co- 
nondeterministic. Then L E coNP/poly. 

It is an easy consequence of Lemma [2] that co-nondeterministic compositions lead to kerneliza- 
tion lower bounds. Being one of many other applications this extension is not made explicit by 
Dell and van Melkebeek [7J (though deterministic compositions are discussed), but their work mo- 
tivated our search for a co-nondeterministic composition. Somewhat surprisingly, from sketching 
a proof for self-containment, it turns out that Lemma [2] not only permits co-nondeterminism. In 
fact, compositions with a dependence of t°^^' on t can be showed to still exclude polynomial kernels 
(in [4J only a factor of log'^t is permitted for cross-compositions, and it comes from a different 
argument). Hermelin and Wu [14] gave a similar (if less explicit on coNP) proof for their notion 
of weak composition where k' = t^''^k^^^', showing that it excludes kernels of size 0{k'^~^). Their 
proof also allows k' = t^/d+o{i)i.O(i) _ 

We first give a definition of the version of composition that we are going to use. 

Definition 2. Let Q C S* xN. A co-nondeterministic polynomial-time algorithm C is a coNP-com- 
position for Q if there is a polynomial p such that on input oft instances (xi, k), . . . , (xt, /c) S S* xN 
the algorithm C takes time polynomial in X]i=i l^«l ^^^ outputs on each computation path an 
instance {y, k') C S* x N with k' < t°^^'p{k) and such that the following holds: 

• If at least one instance {xi, k) is a yes-instance then all computation paths lead to the output 
of a yes-instance (y, k'). 

• Otherwise, if all instances {xi,k) are no-instances, then at least one computation path leads 
to the output of a no-instance. 

We require the following notion of Bodlaender et al. [2] to state our lemma: The unparameterized 
version Q of a parameterized problem Q is defined as Q := {xfj^l^ \ {x, k) G Q}. It is essentially the 
same as Q except for the unary encoding of the parameter value, affecting its classical complexity. 

Lemma 3. Let Q C S* x N 6e a parameterized problem such that Q is NP-hard. If Q has a 
coHV -composition then it does not admit a polynomial kernelization unless NP C coNP/poly and 
the polynomial hierarchy collapses to its third level. 



Proof. Assume that Q admits a polynomial kernelization K with polynomially bounded size h, 
say h{k) = 0{k^). Furthermore, let C be a coNP-composition for Q which outputs instances with 
parameter bounded by t°^^'k'^. We define a polynomially bounded function t by t{N) := ]Sf'^<^+'^, 
By Lemma [2] it suffices to provide an oracle communication protocol for Q where the first player is 
co-nondeterministic and with cost C'(t(A^) log t(A^)) for t inputs each of size at most N. 

Fixing A^ and t := t{N), let t instances each of size at most N be given to the first player, 
say xi, . . . ,xt. Let {xi, ki), . . . ,{xt,kt) denote the corresponding parameterized instances of Q. 

Let us go through the protocol, but consider only the communication cost (for now). By 
definition of Q it follows that all ki are bounded by N. The first player groups the instances 
by parameter value (at most N groups), and applies the co-nondeterministic composition to each 
group. In each computation path this gives r < N instances {G[, k[), . . . , (G^, k'^.). Let us bound the 
parameter values k[, assuming that {G[, k[) was obtained by composing all instances with parameter 
value k: 



I 



Now the first player applies the assumed polynomial kernelization to each instance (G^, k'j). Then 
he sends the obtained kernels to the second player, who tests membership for Q for each one. The 
second player answers yes if at least one of the instances send to him is yes, and no otherwise. 

Each kernelized instance has size at most h{k[) = 0{{k[Y) = 0{{t°^^'N'^Y). Thus we can bound 
the cost of sending the at most A^ kernelized instances to the second player as follows: 

o{N{e'^'^^N'^y) = o(iv^'^+i(t°(^))") = o(i), 

using that t = N'"^+'^. 

It remains to show correctness, in particular taking into account the co-nondeterministic behav- 
ior of the composition. If at least one input instance Xj is a yes-instance, then the corresponding 
instance (C-, k'A will be yes on each computation path. Thus the oracle will answer yes on each 
computation path. Otherwise, if all instances are no, then there must be at least one computation 
path in which all A^ runs of the coNP-composition return no-instances. Applying the kernelization 
will thus create A^ no-instances as well (but note that a coNP-kernelization would suffice). These 
are then send to the oracle, causing it to answer no (on at least one path). 

Thus, assuming a polynomial kernelization for Q, we get an oracle communication protocol for 
deciding the OR of t instances of Q of cost 0{t). By Lemma [2] this implies that Q is contained in 
coNP/poly, and hence, by NP-hardness of Q, that NP C coNP/poly. D 

4 The embedding construction 

In this section we will describe the embedding to be used in the composition algorithm once a 
suitable host graph is found. Given t instances of Ramsey(A;), the construction requires a host 
graph H with at least t vertices. Furthermore, an integer i must be provided such that H neither 
contains a clique nor an independent set of size greater than i, but also such that each vertex of H 
is contained in an independent set or a clique of size exactly i. The magnitude of i in comparison 
to the magnitude of t plays a crucial role for the quality of our construction. 

We emphasize that the requirements on the host graph are loosened slightly compared to the 
example of Section [TJ We achieve this by embedding each instance first in another local structure 
then to be embedded in the host graph. 



Given a host graph H on t' vertices and graphs Gi, . . . ,Gt with t < t' we construct a graph 
G' = Embed{H, k;Gi,..., Gt), the embedding of the graphs Gi into the graph H, as follows. We 
use the dummy graph Dc that is defined as the join of a (c — 2)-clique with an independent set of 
size c — 1. Note that a{Dc) = oj(Dc) = c — 1. Now, assign each instance {Gi, k) to a unique vertex 
of H. By possibly repeating instances we achieve that each vertex of H is assigned an instance. For 
each assignment of an instance Gi to a vertex v oi H create a local graph H^ obtained by joining 
a copy of Gi to a copy of -Dfc-ij joining a copy of the complement Gi to another copy of Dk-i, and 
then forming the disjoint union of the two joins. Finally, to obtain G' , we connect all graphs H^ 
according to the adjacency in H: We fully connect H^ and H^' if and only if v and v' are adjacent 
vertices of H. 

The fact that we may obtain different embeddings, by assigning the instances in a different 
fashion to the vertices of the host graph will be irrelevant for our purposes. 

Lemma 4. Let H be a host graph on t' vertices and {Gi,k), . . . ,{Gt,k) legal inputs with t < t' 
for Refinement Ramsey (^A;j. Suppose every vertex of H is contained in a clique of size i or 
an independent set of size i but H neither contains a clique nor an independent set of size i + 1, 
then Embed{H, k;Gi,... Gt) has a clique or an independent set of size i • (2k — 2). Furthermore, it 
contains a clique or an independent set of size i ■ {2k — 2) + 1 if and only if {Gi, k) is a yes instance 
for some i G {1, . . . , t}. 

Proof. It is easy to see that the local structures H^ from the Embedding construction contain both 
cliques and independent sets of size 2k — 2. Furthermore, if an instance {Gi,k) is a yes instance, 
then the graph H^ contains both an independent set and a clique of size 2A; — 1 (in both cases 
using k — 1 vertices from one of the two copies of Dk-i). 

Suppose V C V{H) forms a clique of size i in H. We can choose a clique of size 2/c — 2 in 
every local graph H^ that is assigned to vertex v gV. The union of these cliques forms a clique of 
size i ■ {2k — 2) in Embed{H , k;Gi, . . . Gt). The analogous statement is true for independent sets. 

Since every vertex in H is contained in a clique or an independent set of size i, if some {Gi, k) 
is a yes instance, then we can choose a clique or an independent set of size 2/c — 2 + 1 in Hy, 
where v is the vertex of H to which {Gi, k) is assigned to, and thus in total we obtain clique or an 
independent set of size i ■ {2k — 2) + 1 in Embed{H, k;Gi, . . . Gt). 

Finally, if no instance {Gi,k) is a yes instance then no clique and no independent set in the 
graph Embed{H, k;Gi,...Gt) can contain more than 2k — 2 vertices from the same local graph H^. 
Since no clique or independent set in Embed{H, k;Gi, . . . Gt) can contain vertices from more than i 
different local graphs, Embed{H, k;Gi, . . . Gt) contains neither a clique nor an independent set of 
size I ■{2k -2) + I. U 

5 A kernelization lower bound for Ramsey (fc) 

In this section we derive our kernelization lower bound for Ramsey(/c). The main work lies in 
developing a co-nondeterministic composition algorithm for Refinement Ramsey(A;). Using the 
embedding construction of the previous section, this is centered around finding a suitable host graph. 
The following lemma about gaps between consecutive Ramsey numbers is required to ensure that 
such a graph can be found. We remark that a general result for additive or even multiplicative gaps 
that holds for any pair of consecutive (diagonal) Ramsey numbers is not known. All logarithms are 
base 2, and we take logt to be at least 1 for t > 0. 
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Algorithm 1 Compose 



Input: t instances (Gi, k),. . . , {Gt, k) of Ramsey(A;) 
Output: "yes" or an instance {G',k') with k' = 0(logt ■ k). 

1: If /c < 3 then solve each instance in time 0{rJ'^'^) = 0{n^) and answer accordingly. 

2: Guess integers T G {1, . . . , ([81ogt] + 1) ■ t} and £ G {I, . . . , [81og(t)]}. 

3: Guess a host graph H with T vertices. 

4: Guess t vertex sets Ai, . . . ,At G [ ^ ) ) which are allowed to overlap. 

5: Unless all Ai induce independent sets or cliques and their union has size at least t, return yes. 

6: Let A' denote an arbitrary minimal choice of sets Ai such that their union has size at least t. 
7: Let H' = H[A']. 

Let G' = Embed{H', k;Gi, . . . ,Gt). 

return {G', k') where k' := £ ■ {2k - 2) + 1. 



Lemma 5. For every integer t > 3 there exists an integer i £ {1, . . . , [81og(t)]} such that R{i+1) > 
R{£) + t. 

Proof. We assume the statement of the lemma is not true, then R{ \8 log(t)] +1) < t- \8 log(t)] +i?(l). 
We use Erdos' classical bound on the Ramsey number which shows that R{N) > 2^ "^ for 
all N e n. This gives us R{\8log{t)] + 1) > 2^^^°s.{t)^/2 > 24iog(i) = t^. Assembling the two 
inequalities we get t^ < t[81og(t)] + R{1), which is false for t > 3 since R{1) = 1. D 

We now give a co-nondeterministic algorithm Compose (see Algorithm [1]) that given t instances 
{Gi,k), . . . , {Gt, k) of Refinement Ramsey(A;) will on each computation path return either the 
answer yes or a single instance (G', k') with k' = 0(\ogt ■ k). (The answer yes may be replaced by 
any constant size yes-instance.) We will then show that Compose is a co-nondeterministic composi- 
tion for Refinement Ramsey(A;). As usual, "guessing" some integer or structure in the algorithm 
corresponds to a (co-)nondeterministic branching of the computation into one independent path 
for each possible value that the integer can take or possible structure that can occur. 

Lemma 6. Compose is a co-nondeterministic composition for Refinement Ramsey ('A; j. 

Proof. Let t instances {Gi,k), . . . ,{Gt,k) be given. W.l.o.g. A; > 3, otherwise we can solve all 
instances in deterministic polynomial time and answer accordingly. Assume for now that t > 3. 

We will first consider the case that at least one input instance is yes. Clearly, it suffices to check 
that all instances (C, k') returned by the algorithm in Step 9 are yes too. We have k' = i-{2k—2)+l. 
If the host graph used for the embedding contains an independent set or a clique of size at least i+1, 
then using that each local structure contains both independent sets and cliques of size 2A; — 2 we 
know that G' contains such a set of size at least (£ + 1) • {2k — 2) > £ ■ {2k — 2) + 1; thus {C , k') 
is yes. Otherwise, it follows from the cover with independent sets and cliques of size £ that H' 
is a suitable host graph fulfilling the requirements of Lemma [H It then follows from the lemma 
that {C , k') is yes. 

The other case is that all input instances are no. It now suffices to show that the algorithm 
finds a suitable host graph on at least one computation path. Lemma [4] then ensures that the 
output (C, k') is a no-instance. 
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Let i denote the smallest positive integer such that i?(^ + 1) > Rip) +t. According to Lemma[5l 
we have that ^ < [81ogt]. Furthermore, by choice of I it follows that R{€) < [i — l)i + -R(l) < 
[8 log i\ ■ t. Thus for some choice of T G {1, . . . , ( [8 log i] + 1) • t} and ^ G {1, . . . , [8 log i] } guessed 
by Compose it holds that T = R{t} + t < R{i + 1). It follows that there exists a graph H on T 
vertices which contains neither a clique nor an independent set on i + 1 vertices. Thus in at 
least one computation path of the algorithm such a graph H will be found. Let us consider such 
a computation path and the corresponding graph H. (If f < 3, then -R(3) = 6 and R{2) = 2 
guarantees that appropriate values of T and i are found.) 

As r = R{i) + t there must exist cliques and independent sets Ai each of size I which cover 
at least t vertices of H; this follows from the definition of Ramsey numbers: While there are at 
least R{i) uncovered vertices, the subgraph induced by the uncovered vertices must contain an 
independent set or clique of size i. Clearly, t sets Ai, . . . ,At can be chosen in such a way that they 
cover at least t vertices. Hence, in one computation path such sets Ai, . . . ,At are found in H. 

Thus, from Step 7 we get a graph H' on at least t vertices which contains no independent set 
or clique of size i+1 but such that each vertex is contained in a clique or independent set of size i. 
Hence, by Lemma [H the graph G' = Embed{H' , k;Gi,..., Gt) has an independent set or a clique 
of size at least k' = i ■ {2k — 2) + 1 if and only if a least one graph Gi contains a independent set 
or clique of size at least k. We note that k' = i ■ {2k — 2) + 1 is bounded by t°^^' k'~^^^' , completing 
the proof. D 

Now, having the co-nondeterministic composition, the following theorem is an immediate con- 
sequence of this composition and Lemma El NP-hardness of the unparameterized version of Re- 
finement Ramsey(A;) follows from Theorem [1] using that nontrivial instances have k < n. 

Theorem 2. Unless NP C coNP/poly and the polynomial hierarchy collapses to its third level 
Refinement Ramsey (^/cj admits no polynomial kernelization. 

Prom Lemma[T]we get the desired lower bound for RAMSEY(/i;). For completeness we sketch this 
argument as well (see Bodlaender et al. [5] for a more general version of transferring kernelization 
lower bounds via NP-completeness and the implicit Karp reduction). 

Corollary 1. Ramsey (^/cj does not admit a polynomial kernelization unless NP C coNP/poly. 

Proof. Let K he a polynomial kernelization for Ramsey (A;) with polynomially bounded size h. 
It is easy to see that K also gives a polynomial kernelization for Refinement RAMSEY(fc): Ap- 
plying K to any instance {G,k) of Refinement Ramsey(A;) gives an equivalent instance {G',k') 
with \G'\,k' < h{k) of Ramsey(A;). Applying the reduction from Lemma [1] yields an equivalent 
instance {G",k") of Refinement RAMSEY(/i;) such that the size of this instance is polynomial in 
the size of {G' , k') and with k" = k' + 1. Thus from K we get also a polynomial kernelization for 
Refinement Ramsey(A;), implying that NP C coNP/poly. D 

6 Conclusion 

We have presented a co-nondeterministic composition for the Refinement Ramsey(A;) problem, 
thereby showing that Ramsey(A;) and Refinement RAMSEY(fc) do not admit polynomial kernel- 
izations unless NP C coNP/poly. On a high level, the use of co-nondeterminism allowed us to 
essentially guess an appropriate pattern in which to combine the given instances. 
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In conclusion we believe that the use of co-nondeterminism in compositions may help in resolv- 
ing whether other problems like, e.g., Multiway Cut and Directed Feedback Vertex Set 
admit polynomial kernels. We mention in passing that similarly to compositions, the use of co- 
nondeterminism may also be of use for kernelization itself. While a polynomial coNP-kernelization 
that crucially uses nondeterminism can hardly be seen as practical, it is of significant theoretical in- 
terest. Indeed, a polynomial coNP-kernelization can be easily seen to exclude coNP-compositions as 
well as weak compositions (the latter depending of course on the degree of the size bound), assum- 
ing NP ^ coNP/poly; the key point is that a coNP-kernelization together with a coNP-composition 
gives an oracle communication protocol with co-nondeterministic first player. 
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